Reference: Biol. Bull. 205: 8-15. (August 2003) 
© 2003 Marine Biological Laboratory 


250 Million Years of Bindin Evolution 


KIRK S. ZIGLER'** AND H. A. LESSI1OS! 


: . . . z 2 . 
' Smithsonian Tropical Research Institute, Balboa, Panamá; and ^ Department of Biology, 
Duke University, Durham, North Carolina 


Abstract. Buindin plays a central role in sperm-egg attach- 
ment and fusion in sea urchins (echinoids). Previous studies 
determined the DNA sequence of bindin in two orders of the 
class Echinoidea, representing 10% of all echinoid species. 
We report sequences of mature bindin from five additional 
genera, representing four new orders, including the distantly 
related sand dollars, heart urchins, and pencil urchins. The 
six orders in which bindin is now known include 70% of all 
echinoids, and indicate that bindin was present in the com- 
mon ancestor of al] extant sea urchins more than 250 milhon 
years ago. Over this span of evolutionary time there has 
been (1) remarkable conservation in the core region of 
bindin, particularly in a stretch of 29 amino acids that has 
not changed at all; (2) conservation of a motif of basic 
amino acids at the cleavage site between preprobindin and 
mature bindin: (3) more than a twofold change in length of 
mature bindin; and (4) emergence of high variation in the 
Sequences outside the core, including the insertion of gly- 
cine-rich repeats in the bindins of some orders, but not 
others. 


Introduction 


Various studies have shown that molecules involved in 
reproduction (and particularly in gamete interactions) 
evolve rapidly, often under the influence of positive selec- 
tion (reviewed in Swanson and Vacquier, 2002). Among 
these proteins there are examples of both high (Metz and 
Palumbi, 1996) and low (Metz ef al., 1998b) levels of 
intraspecific variation. In some cases a single molecule 
displays domains that are highly conserved and other do- 
mains that are highly variable (Vacquier et al., 1995). Vari- 
ation in such proteins 1s usually studied at a low taxonomic 
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level, often within species, sometimes within genera, but 
rarely across an entire class. There are good reasons for this 
focus: such studies are likely to uncover mutationa] changes 
that are important in mate recognition and in speciation. 
However, comparisons across broad taxonomic levels can 
offer insights into the evolution of such molecules. They can 
reveal which features of these molecules are conserved (and 
are thus essential for basic functions) and which features are 
free to vary. For the parts that do vary, such comparisons 
can determine common features of evolution. Most of all, 
the comparisons can address the question of the universality 
of a particular molecule by asking how far back in evolution 
one needs to search to find the point at which a completely 
different molecule has taken over the essential functions 
involved in gamete binding and fusion. 

Echinoids (sea urchins, heart urchins, and sand dollars), 
with their readily obtainable gametes, have long been model 
organisms for fertilization studies. Because fertilization is 
external, the molecules involved in gamete recognition and 
fusion are associated exclusively with the gametes. Bio- 
chemical studies in sea urchins identified the first “gamete 
recognition protein.” bindin (Vacquier and Moy, 1977). 
Bindin is the major insoluble component of the sperm 
acrosomal vesicle and has been implicated in three molec- 
ular interactions (Hofmann and Glabe, 1994). First, after the 
acrosomal] reaction, bindin self-associates, coating the acro- 
somal process. Second, it functions in sperm-egg attach- 
ment by binding to carbohydrates in the vitelline layer on 
the egg surface. Third, it is involved in the fusion of sperm 
and egg membranes (Ulrich et al., 1998, 1999). 

Bindin is translated as a larger precursor, from which the 
N-terminal preprobindin portion is subsequently cleaved to 
produce mature bindin (Gao et al., 1986). The mature bindin 
molecule contains an amino acid core of about 55 residues 
that is highly conserved among all bindins characterized to 
date (Vacquier et al., 1995). An 18 amino acid section of 
this conserved core (B18) has been shown to fuse lipid 
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vesicles in vitro, suggesting that this region functions in 
sperm-egg membrane fusion (Ulrich et al., 1998, 1999). 
Thus far, bindin is known only trom echinoids: no homol- 
ogous molecules have been identified in any other organism 
(Vacquier. 1998). 

To date. the nucleotide sequence of bindin has been 
determined in six genera of sea urchins. In Echinometra 
(Metz and Palumbi, 1996), Strroneylocentrotus (Gao et al., 
1986: Minor et al., 1991; Biermann. 1998; Debenham et al., 
2000), and Heliocidaris (Zigler et al., 2003), there are many 
sequence rearrangements among individuals and species. 
and indications of positive selection in regions on either side 
of the core. In Arbacia (Glabe and Clark, 1991; Metz et al., 
1998a) and Tripneustes (Zigler and Lessios. 2003). there are 
fewer sequence rearrangements and no evidence for positive 
selection. In Lytechinus, only one sequence has been pub- 
lished (Minor et al., 1991), so the mode of evolution of the 
molecule remains unknown. 

The five genera in which bindin was previously se- 
quenced belong to two echinoid orders. the Echinoida and 
the Arbacioida. These two orders contain only 10% of all 
extant echinoid species (Kier, 1977; Smith, 1984; Little- 
wood and Smith, 1995). The molecular structure of bindin 
in the other 13 orders of the class Echinoida has not been 
studied. The only evidence that bindin is present outside the 
Echinoida and Arbacioida comes from Moy and Vacquier 
(1979), who reported that an antibody to bindin of Strongy- 
locentrotus purpuratus reacted with sperm trom one species 
of the order Phymosomatoida and two species of the order 
Clypeasteroida. As Vacquier (1998) has pointed out, mole- 
cules that mediate fertilization—in contrast to those central 
to other basic life processes—olten differ between taxa. For 
example. in the molluscan class Bivalvia. completely dit- 
ferent proteins are involved in gamete recognition of oysters 
(Brandriff et al., 1978) and of mussels (Takagi er al., 1994). 
It is, therefore. not safe to assume without empirical evi- 
dence that bindin is present in all orders of echinoids, or that 
it has the same general structure as in the taxa in which it 
has already been characterized. 

As a first step in determining which orders of echinoids 
possess bindin and, if they do, how its structure varies. we 
cloned and sequenced mature bindin from five genera of sea 
urchins, four of which belong to orders in which bindin was 
previously unknown. We combined our data with those of 
previous studies of bindin in genera belonging to the orders 
Echinoida and Arbacioida. The final data set includes bindin 
from 10 genera of sea urchins, pencil urchins, sand dollars, 
and heart urchins, and the results indicate that the molecule 
was present in the common ancestor of all extant echinoids 
that diverged from each other over 250 million years ago. 
The core sequence has remained remarkably unchanged 
over this period of time. whereas the areas flanking the core 
have undergone substantial modification. resulting in great 


differences in molecular size, amino acid sequence, and 
number of repeats. 


Materials and Methods 
Samples 


The pencil urchins (order Cidaroida) were represented in 
our study by Eucidaris tribuloides, collected on the Atlantic 
coast of Panama; the order Diadematoida by Diadema an- 
rillarum, also from the Atlantic coast of Panama. The sand 
dollars (order Clypeasteroida) were represented by Encope 
stokesti from the Pacific coast of Panama: the heart urchins 
(order Spatangoida) by Moira clotho collected at the Perlas 
Islands in the Bay of Panama. Heltocidaris eryvthrogramma 
(order Echinoida) was collected near Sydney, Australia. 


DNA tsolation and sequencing 


We injected various individuals of each species with 0.5 
M KCI until we encountered one that produced sperm. The 
testes of this ripe male were removed and used either 
directly for mRNA extraction, or after preservation in either 
RNALater (Ambion Inc.) or in liquid nitrogen. The methods 
for mRNA isolation, reverse transcription reactions, initial 
polymerase chain reactions, 3’ and 5’ rapid amplification of 
cDNA ends (RACE) reactions, and DNA sequencing were 
as described in Zigler and Lessios (2003). with the follow- 
ing modifications. (1) A fragment of the core region of 
bindin was amplified from the reverse transcriptase reac- 
tion product or from genomic DNA. using primers 
MB1130+ (5'-TGCTSGGTGCSACSAAGATTGA-3’) and 
either core200- (5’-TCYTCYTCYTCYTGCATIGC-3’) or 
coreI57- (5’-CIGGRTCICCHATRTTIGC-3’). These prim- 
ers correspond to amino acids VLGATKID, ANIGDP, and 
AMQEEEE. respectively (Vacquier et al., 1995). (2) When 
complete 5’ mature bindin sequences were not obtained 
during the first round of 5’ RACE, new primers were 
designed at the 5’ end of the obtained sequence; then a 
second round of RACE amplification was conducted. (3) A 
5’ preprobindin primer was designed based on a comparison 
of preprobindin sequences of Moira clotho (this study) to 
preprobindin sequences of Arbacia (Glabe and Clark. 1991), 
Strongyvlocentrotus (Gao et al., 1986; Minor et al., 1991), and 
Lytechinus (Minor et al.. 1991). This primer, prol80 (5’- 
AAGMGIKCIAGYSCIMGIAAGGG-3’), which corresponds 
to the conserved amino acids KR(A/S)S(A/P)RKG of the 
preprobindin, was used in combination with exact primers 
from the bindin core to amplify mature bindin sequences 5’ 
of the core from Eucidaris tribuloides testis CDNA. (4) 
Bindin sequences obtained from RACE were subsequently 
confirmed by amplification, cloning. and sequencing of full 
mature bindin sequences from testis cDNA. 

Sequencing of both DNA strands was performed on 
an ABI 377 automated sequencer, and sequences were 
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edited using Sequencher 4.1 (Gene Codes Corp.). Se- 
quences have been deposited in GenBank (Accession num- 
bers AY 126482-A Y 126485. AF530406). Published mature 
bindin sequences from a single exemplar from each of 
the five genera in which bindin had been previously se- 
quenced were taken from GenBank. These representatives 
were Strongylocentrotus purpuratus (Accession number: 
M14487, Gao et al., 1986), Lytechinus variegatus (M59489, 
Minor et al., 1991), Arbacia punctulata (X54155, Glabe and 
Clark, 1991), Echinometra oblonga (U39503, Metz and 
Palumbi, 1996), and Tripneustes ventricosus (AF520222, 
Zigler and Lessios, 2003). Three amino acids of the core 
region of the bindin of Lytechinus variegatus [numbers 367 
(N), 368 (L), and 385 (Y) in the alignment of Vacquier et 
al., (1995)] were changed to A, V, and D, respectively, 
based on our own sequence data of Lytechinus bindin from 
25 individuals representing 5 species; all 25 sequences had 
these amino acids at the 3 sites (Zigler and Lessios, unpub.). 
In Echinometra oblonga, sequences for the extreme 3’ end 
of preprobindin are not in GenBank. They were inferred 
from the primer sequences used by Metz and Palumbi 
(1996) to amplify mature bindin sequences. 


Sequence analysis 


We aligned the mature bindin amino acid sequences with 
ClustalX ver. 1.81 (Thompson et al., 1997), and adjusted the 
alignment by eye in Se-A1 (ver. 2.0a5, Rambaut, 1996). We 
characterized the amino acid changes observed in the core 
region of bindin as either radical or conservative with re- 
spect to charge and polarity (Taylor, 1986; Hughes er al., 
1990). The PROTPARAM tool of the EXPASY proteomics 
server of the Swiss Institute for Bioinformatics (http://www. 
expasy.org) was used to calculate Kyte and Doolittle (1982) 
hydrophobicity plots (window size = |] amino acids) for 
each mature bindin sequence. The PROTSCALE tool of the 
Same server was used to calculate amino acid composition 


for the mature bindins both for the core region (10 se- 
quences, 55 amino acids per sequence) and for mature 
bindin sequences outside the core (10 sequences of varying 
length for a total of 1909 amino acids). The program 
CODONS (Lloyd and Sharp, 1992) was used to calculate 
the effective number of codons (ENC), a measure of codon 
usage bias (Wright, 1990), for each sequence. ENC values 
can range from 20 to 61, with 61 indicating that all synon- 
ymous codons are used in equal frequency (no codon bias), 
and 20 indicating that only a single codon is used for each 
amino acid (maximum codon bias). The statistical analysis 
of protein sequences (SAPS, http://www. isrec.isb-sib.ch/ 
software/SAPS_form.html) program was used to identify 
separated repeats, simple tandem repeats, and periodic re- 
peats in each mature bindin sequence (Brendel et al., 1992). 


Results and Discussion 


Figure ] shows the phylogenetic relationships among the 
echinoid orders from which bindin was sequenced, as they 
have been reconstructed from molecular, morphological, 
and fossil evidence (Littlewood and Smith, 1995; Smith et 
al., 1995). As the figure indicates, bindin is present not only 
in the Echinoida and the Arbacioida (from which it was 
previously known), but also in the sand dollars (Clypeas- 
teroida) and the heart urchins (Spatangoida), as well as the 
phylogenetically much more distant Diadematoida and Ci- 
daroida. Along with the sequence of Heliocidaris, reported 
in this paper, and the previously known sequences from 
Arbacia, Strongylocentrotus, Tripneustes, Lytechinus, and 
Echinometra, the data set covers orders that contain more 
than 70% of all extant echinoid species (Kier, 1977). The 
Cidaroida, the only extant order of the subclass Perischo- 
echinoidea, is the lineage most divergent from all other 
echinoids. It was separated from the Euechinoidea approx- 
imately 250 mya. Bindin’s presence in both extant sub- 
classes of the Echinoidea indicates that it was present in 


Millions of years ago Species Order Source 
reee re 

a = - BPE rachis enone. Cidaroida this study 
Diadenia ontillarum Diadematoida this study 
Encope stokesii Clypeasteroida this study 
Moira clotho Spatangoida this study 
Arbacia punctulata Arbacioida Glabe and Clark, 1991 
Strongylocentrotus purpuratus Echinoida Gao et al., 1986 
Tripneustes ventricosus Echinoida Zigler and Lessios, 2003 
Lytechinus variegatus Echinoida Minor et al., 1991 
Heliocidoris erythrogramma Echinoida this study 
Echinometra oblonga Echinoida Metz and Palumbi, 1996 

Figure 1. Phylogenetic relationships, divergence times, and systematic posilion of genera in which bindin 


has been sequenced. Echinoid phylogeny and divergence times are from Smith (1988) and Smith et al. (1995). 


Source of bindin sequence data is also indicated, 
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their common ancestor and that it has been evolving along 
each of the branches of the sea urchin phylogenetic tree for 
more than 250 my. Whether bindin is present in other 
echinoderms remains uncertain. Moy and Vacquier (1979) 
found that their antibody to Strongvlocentrotus purpuratus 
bindin did not react with sperm [rom three species of sea stars, 
and “zoo blots” using S. purpuratus bindin sequences to 
probe genomic DNAs of a sea cucumber and a sea star were 
negative (Minor et al., 1991). No attempt has been made to 
determine bindin’s presence in the ophiuroids or crinoids. 

Figure 2 indicates that the aligned mature bindin se- 
quences are a mosaic of highly conserved and highly diver- 











gent regions. Over the past 250 my, the 55 residues of the 
core (amino acids 155-209) have been remarkably con- 
served. This region does not contain any insertions or de- 
letions in any echinoid lineage. Of the 55 amino acids, 45 
are conserved across all of the 10 exemplars, including a 
strelch of 29 residues in a row (amino acids 164-192). The 
BIS sequence of 18 amino acids implicated in membrane 
fusion (Ulrich et al.. 1998, 1999) is part of this pertectly 
conserved section. Seven amino acid sites in the core region 
exhibit a singleton amino acid change (i.e., a change found 
in only one of the sequences). Four of these changes are 
conservative with respect to charge and polarity (amino 
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Figure 2. 
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Mature bindin amino acid alignment. The first four amino acids are the presumed cleavage site 


from preprobindin. Sites at which amino acids are identical in more than 50% of Ihe genera are enclosed in 
boxes. Conservation across all 10 genera is indicated by an asterisk below the site. Dashes indicate deletions. The 
site for which an intron is known to exist in Echinometra, Arbacia, Strongvlocentrotus, Lytechinus, Tripneustes, 
Heliocidaris, and Diadema is indicated by an “1” under the alignment. The core region is shaded. The B18 region 
of the core is indicated by a bar beneath the alignment. Sites in the core where radical amino acid changes have 
occurred are marked with an ‘R` under the alignment; sites in the core where only conservative amino acid 
changes have occurred are marked with a ‘C’. Stop codons are indicated by an asterisk after the last amino acid. 
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acids at positions 155, 157, 164, and 208), and three are 
radical (positions 193, 194, and 200). Each of positions 196, 
199, and 203 contain three amino acids across the 10 
genera, indicating that there have been at least two changes 
at each of these sites. At least one of the changes at each site 
must have been a radical change. Thus, radical changes are 
observed in only six amino acid positions of the core region, 
all of them concentrated in a small portion of the core close 
to the C terminus (amino acids 193, 194, 196, 199, 200. and 
203). The rest of the core (amino acids 155 through 192 and 
204 through 209) contains only four conservative singleton 
amino acid substitutions. 

A second conserved region is the cleavage site at the 
border between preprobindin and mature bindin (Fig. 2). In 
Strongylocentrotus purpuratus, the cleavage site is marked 
by a motif of four basic amino acids (RKKR) (Gao et al., 
1986). Multibasic motifs are also present in the other nine 
genera (Fig. 2). Such multibasic motifs typically mark the 
cleavage sites of proproteins from the mature molecule 
during the secretory process through the action of propro- 
tein convertases (Steiner, 1998; Seidah and Chretien, 1999). 
The conservation of this multibasic motif in bindin rein- 
forces the idea that it functions as a signal for the cleavage 
of preprobindin from mature bindin in all echinoids. 

In contrast to the core and to the cleavage site, the rest of 
the molecule is so variable between orders that we have 
little confidence that the alignment of these regions depicted 
in Figure 2 is correct. There is a great amount of variation 
in the length of mature bindin both on the 5’ and on the 3’ 
side of the molecule (Table 1). This study identifies both the 
longest and the shortest bindins described to date. Bindin in 
Diadema antillarian (418 amino acids) is more than twice 
as long as bindin in Encope stokesii (193 amino acids). 
Bindin length 5’ of the core ranges from 78 to 148 amino 
acids, while bindin length 3’ of the core ranges from 56 to 
215 amino acids. There seems to be no discernible evolu- 
tionary trend in bindin length. Closely related orders do not 


Table 1 


Number of amino acids in three regions of the mature bindin in 10 


genera 

5 Core 3’ Toial 
Eucidaris {01 55 60 216 
Diadema t48 DA, 215 418 
Encope 82 53 56 193 
Moira t38 55 94 287 
Arbacia 105 55 73 233 
Lytechinus 103 59 60 218 
Tripneustes 88 55 68 211 
Strongylocentrotus 82 59 99 236 
Heliocidaris 78 55 73 206 


Echinometra tli 55 T5 24] 


5’ and 3’ regions are defined relative to the conserved core. 


tend to have bindins that are of similar length. Indeed, it 
cannot be assumed that the genera we have included in the 
study are representative of their orders in this respect. The 
regions on either side of the core were found to confer 
species-specificity in Strongylocentrotus (Lopez et al., 
1993). If their variation reflects the requirements of this 
function, they can be expected to vary in a phylogenetically 
unpredictable fashion. 

An intron located at a conserved position just 5’ of the 
core region has been identified in the mature bindins of 
Echinometra (Metz and Palumbi, 1996), Arbacia (Metz et 
al, 1998a), Strongylocentrotus (Biermann, 1998), and Tri- 
pneustes (Zigler and Lessios, 2003). In each of these genera, 
the intron is located at a conserved valine (amino acid 150 
in Fig. 2). Comparison of sequences derived from both 
cDNA and from genomic DNA in Heltocidaris (Zigler et 
al., 2003), Lytechinus, and Diadema revealed that in these 
genera the intron also exists at the same location and that its 
point of insertion is also a valine. We have made no attempt 
to amplify bindin from genomic DNA in Eucidaris, Moira, 
and Encope, so we do not know whether this intron is a 
universal feature of all bindins. These three genera do not 
have a valine in the site at which the intron is known to exist 
in the others, but the significance of this pattern cannot be 
evaluated with the present data. 

Previous studies have identified both bindins with gly- 
cine-rich repeat structures and bindins that lack such struc- 
ture. Glycine-rich repeats were found in the bindins of 
Lytechinus (Minor et al., 1991), Strongylocentrotus (Bier- 
mann, 1998), Echinometra (Metz and Palumbi, 1996), and 
Tripneustes (Zigler and Lessios, 2003), all members of the 
order Echinoida. Consistent with the phylogenetic position 
of Heliocidaris, its bindin also contains glycine-rich repeat 
sequences, with MGGGN and VGGGGP on the 5’ side of 
its core, and the series MGGG-MGGGGP-MGGGGP- 
MGGGGM-MGFQG-MGGOQPP on the 3’ side. Although 
Moira belongs to a different order, its bindin also contains 
extensive glycine-rich repeats, with the sequence PGGGL- 
PSGGL-AGGGL-PVGGL-AGGGL-PVGGL-AGGGF- 
PGGGL-QGGGF-QGGGL-PGGGG found 5’ of the core. 
Glabe and Clark (1991) noted that bindin from Arbacia 
punctulata Jacked significant repeat structure, and this ob- 
servation was extended to three other species of Arbacia 
(Metz et al., 1998a). Eucidaris, Encope, and Diadema re- 
semble Arbacia in containing only minimal tandem or sep- 
arated repeats, the longest of which 1s PAAP-PPAP-PAAP 
in the region flanking the 5° side of the core in Eucidaris. 
Thus, glycine-rich repeat structure remains a common trait 
of the bindin of the Echinoida, although, as the data from 
the spatangoid Moira indicate, it is not a characteristic 
limited to this order or even to a closely aligned clade. 

There are no cysteine or tryptophan residues in any 
mature bindin. Disulfide bonds formed between cysteine 
residues are often critical for protein structure, and in rap- 
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idly evolving proteins—such as toxins of cone snails (Duda 
and Palumbi, 1999) and pheromones of the marine ciliate 
Euplotes (Luporini et al., 1995)— cysteine residues are of- 
ten among the most conserved amino acids, serving as 
guides for aligning sequences. Thus, the lack of cysteine 
residues in bindin may have important structural conse- 
quences. When all sequences are pooled, glycine is by far 
the most common amino acid outside the core, constituting 
nearly a quarter of all residues. If the orders that possess 
glycine-rich repeats (Echinoida and Spatangoida) are sepa- 
rated from those that do not, glycine remains the most 
common amino acid in both categories, constituting 29.6% 
of the non-core amino acids in the former and 16.4% of 
non-core residues in the latter. The six most common resi- 
dues outside the core (G, A, P. Q, N. and E) compose 63.9% 
of all non-core residues. Leucine is the most common amino 
acid in the core, present in 10 completely conserved amino 
acid positions, including 6 of the 18 amino acids in the BIS 
region. There is a much higher proportion of charged resi- 
dues in the core (31.8%) than in the rest of the molecule 
(15.6%). Each of the five charged amino acids (E. D. RTH. 
and K) is more common tn the core. 

Another common feature of all bindins is their lack of 
codon usage bias. ENC values among the 10 genera range 
from 6] (for Eucidaris and Diadema) to 48.) (for Arbacia). 
with an average of 56.4. Low levels of codon usage bias 
have also been observed in sex-related genes in Drosophila 
(Civetta and Singh, 1998) and in the Chlamydomonas mat- 
ing-type locus genes Mid and Fus] (Ferris et al., 2002). 

Given the large divergence in amino acid sequence and 
length (and the uncertainties in alignments), it is not sur- 
prising that hydrophobicity plots (Fig. 3) from these bindins 
are diverse. The conserved amino acid sequence of the core 
and its flanking regions causes all plots to be similar through 
the middle of the molecule. Plots of the closely related 
Tripneustes ventricosus, Lytechinus variegatus, Helioci- 
daris erythrogramma, and Echinometra oblonga bindins are 
similar throughout their lengths. The rest of the hydropho- 
bicity plots are not clearly similar. One particularly distinct 
region is the long hydrophilic stretches in Diadema bindin 
along its extended length. A second is the highly hydropho- 
bic region 3’ of the core of Arbacia bindin, noted by Glabe 
and Clark (1991), 

The only other gamete recognition protein that has been 
studied in marine invertebrates separated for as long as 250 
my is the gastropod sperm protein lysin. Lysin opens a hole 
in the vitelline envelope of free-spawning snails and thus 
enables sperm to penetrate to the plasma membrane of the 
egg. It has been studied in the abalones (Haliotis) (Lee and 
Wacquier, 1992: Cee ef al, 1995: Yang ef al.. 2000; re- 
viewed in Kresge er al., 200!) and in two genera of turban 
snails, Tegula and Norrisia (Hellberg and Vacquier, 1999). 
Abalones and turban snails diverged 250 mya. roughly the 
same time the cidaroids separated from the euechinoids. The 
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Figure 3. 
genera are presented in the same order as in Figure | and abbreviated as 
follows: E.t.. Eucidaris tribuloides, D.a.: Diadema antillarum, E.s.: En- 
cope stokesii, M.c.: Moira clotho, A.p.: Arbacia punctulata, S.p.: Strongy- 


Hydrophobicity plots of mature bindin in 10 genera. The 


locentrotus purpuratus, T.v.: Tripneustes ventricosus, L.y.: Lytechinus 
variegatus, H.e.2 Heliocidaris ervthrogranuua, and E.o.: Echinometra ob- 
longa. A box encloses the core, and a bracket indicates the hydrophobic 
region in Arbacia. The scale bars above and below each zero line mark +1 
and —1, respectively. 


additional bindin sequences reported here reinforce the con- 
clusions of Hellberg and Vacquier (1999) from comparisons 
between the modes of evolution of these two proteins. 
Although they are both involved in gamete recognition and 
both lack cysteine residues, they evolve in different fash- 
ions. There is no equivalent of a bindin core region in lysin: 
amino acid substitutions are spread throughout the mole- 
cule, with only three amino acids conserved between all 
Haliotis species and the two teguline genera. Instead of 
conserving a section of the molecule, lysin has maintained 
its function by conserving secondary structure through con- 
servative amino acid substitutions (Hellberg and Vacquier, 
1999), Length variation is another obvious difference. Ma- 
ture bindin length varies from 193 to 418 amino acids, but 
lysin length (at least in the two groups studied to date) only 
from 126 to 138 amino acids. 
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Conclusions 


The comparisons of bindin from 10 genera of echinoids 
reveal the results of long-term evolutton under two oppos- 
ing selective forces acting on gamete recognition molecules. 
The sections of the molecule involved in the basic functions 
of gamete fusion and post-translalional cleaving of the 
preprobindin have been remarkably conserved over 250 my 
of evolution, presumably through purifying selection. The 
sections involved in species recognition have been evolving 
rapidly in seemingly unpredictable directions, presumably 
under diversifying selection; such changes are likely to be 
specific lo each species. 

A number of features identified by these compartsons are 
in need of functional explanations. Among the conserved 
features, the lack of change in the core region is the only one 
that can be easily explained. We do not yet know whether 
there is a particular reason for the low codon usage bias of 
all bindins, for the absence of tryptophan or cysteine resi- 
dues, or for the absence of major hydrophobic regions in all 
bindins except that of Arbacia. The differences between the 
orders are equally puzzling. Is there a functional reason for 
the length variation of the regtons outside the core? Why do 
the Echinoida and the Spatangoida have glycine-rich repeals 
in the regions flanking the core, while other orders do not? 
Comparisons alone cannot provide answers to these ques- 
tions; but they can identify features of the molecule thal are 
worthy of functional study. 
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